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ABSTRACT 

Surface ozone (0 3 ) concentrations in excess of 
the National Ambient Air Quality Standard (NAAQS) 
continue to occur in metropolitan areas in the United 
States despite efforts to control emissions of 0 3 precur- 
sors. Future 0 3 control strategies will be based on 
results from modeling efforts that have just begun in 
many areas. Two initial questions that arise are model 
sensitivity to domain-specific conditions and the selec- 
tion of episodes for model evaluation and control stra- 
tegy development. For the Baltimore- Washington 
region (B-W), the presence of the Chesapeake Bay 
introduces a number of issues relevant to model sensi- 
tivity. In this paper, the specific questions of the deter- 
mination of model volume (mixing height) for the 
Urban Airshed Model (UAM) is discussed and various 
alternative methods compared. For the latter question, 
several analytic approaches (cluster analysis and 
classification and regression tree (CART) analysis) arc 
undertaken to determine meteorological conditions asso- 
ciated with severe 0 3 events in the B-W domain. 

1. INTRODUCTION 

Many major metropolitan areas in United States 
have been unable to comply with the NAAQS for 0 3 set 
by the Clean Air Act (CAA). Many explanations have 
been advanced for this (1) failure to reduce emissions 
to extent proposed (rule efficiency); (2) underestimation 
of emissions, including biogenic emissions, and errors in 
the spcciation of hydrocarbons; and (3) overly simplis- 
tic modeling approaches which have overestimated the 
effectiveness of proposed control strategics ( National 
Research Council , 1991). 

The 1990 amendments to the CAA mandate 
further use of photochemical modeling to demonstrate 
that control strategies will result in die future attainment 
of the 0 3 NAAQS. This paper will provide a brief 
overview of the photochemical model in use in the B-W 
domain, the UAM, as well as critical U.S. Environmen- 
tal Protection Agency (EPA) recommendations 
configuring the UAM for regulatory use. In addition, 
model sensitivity questions that have a domain-specific 
facet will be discussed along with the selection of his- 
torical 0 3 episodes for modeling. 

2. THE URBAN AIRSHED MODEL 

The UAM is presently the only EPA-approved 
three-dimensional photochemical model for 0 3 regula- 
tory modeling. The UAM was first developed for use in 


the Los Angeles Basin ( Reynolds , et til., 1973) and the 
1988 edition is in use in the B-W domain ( Morris et ai, 
1990). In order to efficiently manage the numerous 
evaluation and control strategy model runs that arc 
required, the input and archival processes are controlled 
by a software shell specially developed for the UAM 
{Batch Processing System (BPS), John Haus). 

The UAM is a gridded Eulerian model, with 
three-dimensional cells having fixed horizontal and vari- 
able vertical dimensions. The vertical layers vary in 
space and time as a function of the height of the plane- 
tary boundary layer (PBL). generally referred to in 
UAM studies as the "mixing height". The UAM simu- 
lates spatially varying transport, diffusion, chemical 
transformation, deposition and emissions distributions 
within an area by solving the governing equations for 
particular processes. Atmospheric constituents arc 
dispersed evenly within each cell. Surface removal is 
parameterized by deposition velocity which depends, 
generally, on surface concentration, roughness and type. 
The 1988 version of the UAM was updated to include 
the Carbon-Bond Mechanism IV ( Gery , Whitten and 
Killus, 1985) and the Smolarkicwicz algorithm for the 
numerical approximation of horizontal advcclion terms 
in the species conservation equations. 

The size of the domain {Figure 1) is determined 
by the expected distance that local emissions will be 
transported during an average day {EPA, 1991). This 
reflects the relatively slow process of photo-chcmical Q 3 
production. The B-W domain is 250 km north-io-south 
and 230 km east-to-west and contains several unique 
aspects (1) the Chesapeake Bay dominates the center of 
the domain and introduces discontinuities in the surface 
heating and surface roughness fields; (2) the domain 
includes two urban cores and four states which intro- 
duces difficulties in emissions inventory development 
and model initialization; and (3) the extension of the 
domain boundaries to the north, to include portions of 
Greater Philadelphia as well as the Atlantic coast, intro- 
duces boundary condition problems. 

The EPA recommends that the grid cells be no 
larger than 5x5 km. The UAM grid size can be varied 
to reflect emissions gradients and density and should 
bear some association to the resolution of the input data. 
The resolution of meteorological data, of course, is 
much coarser than 5 x 5 km and air quality data resolu- 
tion can vary widely. 

3. MODEL SENSITIVITY ANALYSIS 

Model sensitivity issues can be roughly divided 
into universal or domain-specific categories. Chemical 
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mechanisms and reaction rates arc examples of issues 
that are relatively independent of the model application. 
In the latter group arc local effects such as unique 
meteorological or topographical conditions. In this 
paper domain-specific meteorological phenomena related 
to the model volume will be discussed. Results of ear- 
lier model sensitivity analyses with the UAM are con- 
tained in Seigneur, et al. 11981]; Dennis and Downton 
(1984]; Seinfeld [1988]. 

The UAM is vertically described by model layers 
broken up into two regimes, with differing diffusion 
characteristics, separated by the "diffusion break", or 
mixing height. The model layers expand and contract 
evenly with the height of the diffusion break. In gen- 
eral, the layers below the diffusion break arc assumed to 
be well-mixed with a homogeneous wind field and 
capped by an inversion. This idealized version of tur- 
bulent processes within Lhc PBL reflects the origin of 
the UAJvl in Los Angeles applications where a strong 
subsidence inversion is frequent ( Taylor and Marsh, 
1991). It is not clear that this concept is as physically 
accurate in the B-W domain and early sensitivity work, 
using a simplified data base in this domain, showed that 
peak 0 3 concentrations arc very sensitive to variations 
in the height of mixed layer. As a baseline, a 50% 
reduction in mixing height increased peak 0 3 by 33% 
(M. Jorquera , personal communication). 

The hourly evolution of the mixing height (model 
volume) is determined via interpolation from minimum 
and maximum mixing height values calculated by an 
EPA algorithm (Kelly. 1981). The EPA algorithm uses 
potential temperature profiles, calculated from 
radiosonde data, and a static stability analysis to deter- 
mine tire extent of buoyant overturning of surface-based 
parcels of air. Tins analysis assumes that the air near 
the surface cools adiabatically as it rises so that 0 is 
conserved. The maximum depth of the mixed layer is 
die height at which the ambient potential temperature 
(0,) is equal to or greater than surface potential tempera- 
ture (0 O ). 

TTiis method for estimating mixing heights raises 
several questions. First, the timing of the radiosonde 
launches (0000 Universal Coordinated Time (UTC) and 
1200 UTC) are early morning (0800 EDT) and early 
evening (2000 EDT) in the B-W domain and arc not 
optimal for observing either mid-day maximum or early 
morning minimum mixing height. To account for this, 
the EPA algorithm replaces the surface temperature at 
the lime of the sounding with the morning minimum 
and afternoon maximum surface temperature. The 
simulation of mid-day temperature profiles is reasonable 
given the relatively poor conductivity of air but will fail 
in conditions of synoptic-scale temperature advection. 
For a severe 0 3 event (July 6, 1988) using the 1200 
UTC and 0000 UTC soundings results in a 36% 
difference in maximum mixing height. 

Second, 0 profics arc most useful in dry environ- 
ments and in situations where bouyancy forces driven 
by the earth's heating dominate over larger scale forces, 
such as convergence associated with approaching fronts, 
which can induce upward motion. The eastern U.S. is 
quite moist in the summer months and the mixing ratio 
of water is not constant with altitude. In these eases, a 
measure which accounts for the presence of moist air, 
virtual potential temperature (0 V ), is more useful (Stull, 
1991). For a three day episode in July, 1988, the 0 V 
profiles of mixing height at three stations averaged 338 
m or 21.4% higher than the 0 profiles. 

Finally, mixing heights calculated from a specific 
sounding station must be extrapolated to other locations 
within the domain. Dulles International Airport (IAD) 
is the only upper air station within the B-W domain. 


The two nearest stations (Wallops Island, VA and 
Atlantic City, NJ) arc liable to coastal effects, especially 
in the planetary boundary layer. Horizontal interpola- 
tion is accomplished by a proportional variation in mix- 
ing height based on temperature differences, or the 
entire sounding can be applied to surface temperatures 
at other stations. This is a particular problem with 
respect to Chesapeake Bay and coastal locations within 
the B-W domain. Previous studies have shown that the 
mixing height can vary significantly in this area ( Segal 
etal.,1982). 

The sparseness of meteorological data and the 
sensitivity of the UAM to input parameters such as mix- 
ing height, and wind fields, suggest that regulatory 
applications of the UAM should be confined to 0 3 
episodes characterized by uniform meteorological fields. 
That is, the cases should be as "model- tractable" as pos- 
sible. 

4. EPISODE SELECTION 

EPA has recommended that several groups of his- 
torical severe Oj episodes be modeled with each group, 
or regime, representative of a "distinctly different 
source-receptor relationship" (EPA, 1991). The selec- 
tion of specific eases must then be made from die most 
severe O3 episodes in each regime. Identifying discrete 
transport regimes in the context of severe 0 3 episodes is 
a difficult process. Severe 0 3 events in the eastern U.S. 
arc typically associated with slow-moving, or stagnant, 
surface anticyclones (Vukovich et ai, 1977; Vukovich 
and Fishman, 1986). Light and variable surface winds 
arc associated with anlicyclonic flow, especially near the 
center of the anticyclone. As a result, the likelihood of 
strong surface wind signals for distinct types of O3 
events are slight. In the B-W domain, there is an addi- 
tional complicating factor in the strong bay breeze sig- 
nal associated with weak synoptic forcing (Schofield and 
Weiss, 1977). This effect is often not seen in the 
synoptic-scale observation network. Surface wind rose 
calculations carried out in accordance with EPA recom 
mendations show that for all types of conditions there is 
a single strong WSW signal. This suggests that there is 
only one transport regime associated with strong Oj 
events or that this method is not effective. 

In order to determine if there is a useful alterna- 
tive classification strategy, three approaches were under- 
taken (1) cluster analysis; (2) classification and regres- 
sion tree (CART) analysis; and (3) subjective meteoro- 
logical analysis of severe 0 3 events. The cluster 
analysis seeks to determine weather patterns using a set 
of meteorological data from a single location in a partic- 
ular season. The shortcoming of this approach is that 
weather patterns, expressed in terms of meteorological 
variables, are generally homogeneous and categorization 
results in a loss of information as eases are forced into a 
Procrustean bed of clusters. In this study, summer sea- 
son (June, July and August) data at Baltimorc- 
Washington International Airport (BW1) for the period 
from 1983-1990 was used. Surface data consist of four 
times daily (10, 14, 18 and 2200 UTC) pressure, tem- 
perature, dew point temperature, wind velocity and sky 
cover. Upper air data (850 and 700 mbar) from the 
closest upper air station (IAD), was also included and 
consist of 1200 UTC gcopotcntial height, temperature, 
moisture (expressed as dew point temperature) and wind 
velocity. 

The cluster analysis consisted of two steps: First, 
principal components analysis (PC A) was applied to the 
original data matrix (Kalksteln and Corrigan, 1986). 
The cluster analysis is performed on the matrix of com- 
ponent scores using the average linkage method (Kalk- 
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stein’ ct ai, 1987). As expected, the bulk of the total 
variance is determined by temperature, pressure and sky 
cover. Of the ten total clusters, two contain the bulk of 
the most severe 0 3 eases and are distinguished by hot 
and moist conditions (Cluster 3) or stagnant surface con- 
ditions (Cluster 9). In terms of source-receptor relation- 
ships, both clusters exhibit light WNW upper air winds. 

To further analyze local effects, the CART 
analysis was used ( California Statistical Software , 
1991). This type of analysis has been used for 0 3 stu- 
dies {llorie, 1987) and general weather forecasting (Bur- 
rows, 1991) and has been recommended for use in regu- 
latory applications ( National Research Council, 1991). 
The CART technique operates by a binary splitting of 
data into groups that arc more homogeneous ( Breiman , 
et ai, 1984). A succession of binary splits results in a 
’’tree’* whose final "branches", or terminal nodes, 
represent distinct classes, or categories of data. In this 
ease, the predictand is domain mean maximum 0 3 for 
the period JJA 1983-1990. The predictors consist of 
surface variables as in the cluster analysis. The 
observed upper air variables arc expanded to include 
data at 50 mbar intervals between 950 and 650 mbar. 

The majority of the strong 0 3 eases arc grouped 
in two of the five CART terminal nodes. The non-0 3 
nodes are easily filtered out by mid-day surface tem- 
perature of less than 86°F. The two high 0 3 nodes are 
distinguished by a linear combination dominated by 
midday temperature. The cases in the strongest 0 3 
node arc characterized by lower morning temperature, 
higher pressure and higher afternoon temperatures. 
Wind directions arc highly variable within each node 
and do not contribute significantly to differences at this 
level. Winds at the surface in tire strong 0 3 nodes arc 
generally west (W) with west-northwest (WNW) winds 
aloft. In terms of episode selection, the results of the 
CART analysis can separate the weak 0 3 eases but can- 
not further distinguish regimes within the high 0 3 cases. 

A second scries of runs was undertaken using a 
subset of 195 strong 0 3 eases in which domain mean 0 3 
exceeded 90 ppbv or in which the NAAQS was 
violated. In the cluster analysis the major components 
are essentially the same as in the initial run except that 
three additional components arc added that include com- 
binations of surface and upper air winds, afternoon sky 
cover and upper air temperatures. The two strongest 0 3 
sub-clusters arc similar to the major clusters above with 
one cluster characterized by extremely hot and moist 
conditions with WNW winds aloft and afternoon south- 
erly surface winds and a second sub-cluster character- 
ized by stagnant conditions. The remaining sub- 
clusters, which have less severe average 0 3 , but contain 
a smattering of severe cases, arc variations on the hot 
and moist sub-cluster with differences based on wind 
direction. 

A second CART analysis using the subset of 
strong 0 3 eases was also undertaken. The key variables 
responsible for the distinction between terminal nodes in 
this analysis arc upper air temperature and the 0 V gra- 
dient. The strongest, in terms of 0 3 , node is character- 
ized by high temperature at all levels with a weak gra- 
dient in 0 V . The remaining nodes arc characterized by 
strong 0 V gradients but cooler temperatures. An infer- 
ence that may be drawn from this distinction is that 
CART distinguishes between "hot” eases in which high 
temperature and low sky cover drive strong local 0 3 
chemistry and eases in which stable lower tropospheric 
conditions trap emissions into a smaller volume. 

The subjective analysis investigated multi-day 0 3 
episodes for the 1983-1990 period (N=159). Multi-day 
events represent the bulk (83%) of the extreme 0 3 cases 
from which the episodes must be selected. This 


analysis classified cases by the position of the nearest 
surface anticyclone, the position of surface fronts and 
the upper air transport pattern as evidenced by 850 mbar 
height patterns. This process was able to successfully 
classify 86% of the eases included in multi-day 0 3 
events and was more succesful (95%) with the most 
extreme twenty events. The subjective classification 
grouped the cases into four classes based mainly on 
differences in upper air height fields. Class 1 contains 
cases with northerly upper air winds and a surface anti- 
cyclone to the southwest Class 2 contains similar sur- 
face conditions, although with a higher incidence of lee 
troughs ( Pagnotti r 1987), but westerly winds aloft. 
Class 3 is characterized by a surface anticyclone to the 
SE, often near Cape Hatterras, and SW flow aloft. 

This class is typical of 0 3 events in the northeastern 
U.S. Finally, Class 4 contains cases in which stationary 
fronts, or slow-moving cold fronts arc present in, or just 
south of, the domain. 

When the subjective classifications are compared 
to the results of the cluster analysis using the subset of 
195 cases and focusing on the classification of the 
fifteen strongest 0 3 episodes there is some consistency. 
The sub-clusters with northerly winds and stagnant con- 
ditions arc both associated with Class 1 (northerly air 
(low) in the subjective analysis. The hot sub-cluster 
with WNW upper air and WSW surface winds is associ- 
ated with Class 2 and the sub-cluster with southwes- 
terly winds is associated with Class 3. A similar com- 
parison with the CART terminal nodes is less success- 
ful. The eases from Clashes 1 and 2 in the subjective 
analysis arc typically grouped together by the CART 
analysis. Cases with southerly air flow (Class 3) are 
successfully separated into another node by CART. 
However, when all multi-day events arc compared, there 
is very little coherence between the cluster, CART and 
subjective anlayscs. 

5. CONCLUSION 

Due to the unique topography of the B-W 
domain, variations in mixing heights arc expected to be 
large. The UAM is sensitive to variations in mixing 
heights and different analyses used to create mixing 
height fields result in significant variations. Given the 
UAM sensitivity to meteorological input fields, 0 3 
episodes to be analyzed for regulatory purposes should 
exhibit relatively uniform, or model-tractable, meteoro- 
logical fields. Clustering techniques and subjective 
meteorological analysis are able to provide some evi- 
dence of distinct weather regimes associated with .severe 
0 3 events in the B-W regime. The subjective analysis 
can also provide insight into the most model-tractable 
episodes. However, the EPA regulations relating to 
episode selection require that the final selections be lim- 
ited to the most extreme 0 3 events in each regime. 
These cases often exhibit unusual meteorological condi- 
tions such as lee trough formation. 
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Fig. 1. UAM domain for Baltimore -Washing ton region. 
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